Galaxy: A Platform for Explorative Analysis of Open Data Sources

نویسندگان

  • Seyed-Mehdi-Reza Beheshti
  • Boualem Benatallah
  • Hamid R. Motahari Nezhad
چکیده

A large volume of Open Data is being generated on a continuous basis. Examples of this are the case of social, natural, and information systems such as World Wide Web and social networks. Most entities and objects in the Open Data are interconnected, forming a complex, semi-structured, and information-rich networks. In this sense, Linked Open Data has the potential to be similar to a federated database. Since Linked Open Data is based on W3C standards, it is possible to implement a federation infrastructure, however, the current SPARQL standard makes it challenging to analyze the Open Data in an explorative manner. Consequently, it will be hard to discover the hidden knowledge in the relationships among entities in Open Data sources. In this paper, we present Galaxy, a platform for explorative analysis of Open Data Sources. Galaxy facilitates the analysis of Open Data graphs based on simple abstractions, i.e. folders and paths, which enable an analyst to group related entities in the graph or find paths among entities. Galaxy uses Hadoop data processing platforms to store and retrieve large numbers of RDF triples and to support cost-effective and Web-scale processing of Semantic Web data through a Folder-Path enabled extension of SPARQL.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

BioFuice: Mapping-Based Data Integration in Bioinformatics

We introduce the BioFuice approach for integrating data from different private and public data sources and ontologies. BioFuice follows a peer-topeer-like data integration based on bidirectional mappings. Sources and mappings are associated with a domain model to support a semantically meaningful interoperability. BioFuice extends the generic iFuice integration platform which utilizes specific ...

متن کامل

A Fuzzy TOPSIS Approach for Big Data Analytics Platform Selection

Big data sizes are constantly increasing. Big data analytics is where advanced analytic techniques are applied on big data sets. Analytics based on large data samples reveals and leverages business change. The popularity of big data analytics platforms, which are often available as open-source, has not remained unnoticed by big companies. Google uses MapReduce for PageRank and inverted indexes....

متن کامل

A Novel System-Level Calibration Method for Gimballed Platform IMU Using Optimal Estimation

An accurate calibration of inertial measurement unit errors is increasingly important as the inertial navigation system requirements become more stringent. Developing calibration methods that use as less as possible of IMU signals has 6-DOF gimballed IMU in space-stabilized mode is presented. It is considered as held stationary in the test location incorporating 15 di...

متن کامل

Detailed analysis of observed antiprotons in cosmic rays

In the present work, the origin of antiprotons observed in cosmic rays (above the atmosphere) is analyzed in details. We have considered the origin of the primaries, (which their interactions with the interstellar medium is one of the most important sources of antiprotons) is a supernova type II then used a diffusion model for their propagation. We have used the latest parameterization for anti...

متن کامل

On the Reachability of Trustworthy Information from Integrated Exploratory Biological Queries

Levels of curation across biological databases are widely recognized as being highly variable, depending on provenance and type. In spite of ambiguous quality, searches against biological sources, such as those for sequence homology, remain a frontline strategy for biomedical scientists studying molecular data. In the following, we investigate the accessibility of well-curated data retrieved fr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016